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Abstract 



We prove by symmetry properties that the Lagrangian of a free point- 
mass is a quadratic function of the speed in the non-relativistic case, and 
that the action of the free point-mass between two spacetime points is the 
proper time passed in the relativistic case. These well known facts are 
"^J ■ proved in a mathematically rigorous way with a frame independent treat- 

ment based on spacetime models introduced by Matolcsi. The arguments 
show that these results are not obvious at all, some common beliefs can 
be refuted by explicit counterexamples. In our treatment the similarity of 



^ non-relativistic and relativistic cases is apparent. 

£V Introduction 
43 



Spacetime symmetries play a fundamental role in physical theories. We can 
distinguish between two points of view regarding their application. If govern- 
ing equations are known, symmetries provide conserved quantities (symmetry 
charges) of the theory, conditions on their conservation and integrals of the 
governing equations. When the dynamics is given by variational principles the 
symmetries are exploited by Noether theorems. If governing equations are un- 
known, then symmetries are the basic tools to construct a proper dynamics 
(based on any kind of formulation) of the theory. The starting point is always 
to determine the Lagrangian of free systems (point-mass or field). A free sys- 
tem is invariant under all symmetries of the corresponding spacetime and it is 
completely determined by this property. 

In classical mechanics, where the governing equation - the Newton equation - 
is well known one can exploit symmetries by Noether theorems (see e.g. ||, [?], [jj ) . 
On the other hand, in quantum field theories symmetries are common tools to 
construct appropriate Lagrangians (see. e.g. 0). A similar - exceptional 
- approach in classical mechanics can be found in the books of Landau and 
Lifsic where the method is applied for finding the Lagrangian of a point-mass 
in non-relativistic dynamics |j| and partially also in relativistic dynamics Q. 

That derivation (and all similar subsequent derivations in field theories) 
are unsatisfactory from several points of view. Although the Lagrangians are 
coordinate free, they are observer and therefore frame dependent. Moreover, 
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it is not clear whether the action, the Lagrangian, the resulted Euler-Lagrange 
equations or the solutions of the Euler-Lagrange equations are the transformed 
objects that should be invariant under the symmetries. Different papers and 
textbooks give different answers, moreover these cases are frequently mixed (and 
contradictory) in the very same treatment. 

We shall see that if the Lagrangian or the action itself were required to be 
invariant for all spacetime symmetries (Noether or Poincare transformations), 
then we should get a constant function. Thus it is essential that instead of 
the Lagrangians, the invariant quantities should be the equivalence classes of 
the Lagrangians (two Lagrangians differing in a full time derivative are con- 
sidered equivalent). Somewhere in the literature this problem is obliterated by 
referring to the fact that two Lagrangians in the same equivalence class can be 
transformed into each other with a canonical transformation ||l7| , p26]. That is 
true, but it has nothing to do by invariance because we cannot transform the 
variables anyhow when investigating invariance under variable transformations 
(spacetime symmetries). 

The situation is even more confused in the (special) relativistic case where 
the observer dependence of the Lagrangian is more apparent (there is no frame 
independent time). There symmetry considerations are essentially ignored [^) 
with the argument that the only invariant scalar is proper time. If symmetry is 
taken into account, the invariance of the solutions of Euler-Lagrange equations 
is required (l8) resulting in a set of basically different Lagrange functions of a 
single free point-mass. 

In this paper only the dynamics of free point-masses is considered, therefore 
we require invariance under the full spacetime symmetry group. Our treatment 
can give surprising results constructing dynamics of interacting systems, when 
only partial spacetime symmetries arc required. First of all in our frame in- 
dependent spacetime models it is apparent that some of the partial spacetime 
symmetries are observer dependent (there is no time translation, time inversion 
etc. without a frame), therefore they cannot be considered as fundamental. In- 
dependently of the above mentioned property of partial symmetries, our more 
exact study gives some real surprises: we shall show e.g. by an explicit ex- 
ample that spacetime translation symmetry alone does not imply momentum 
conservation. 

In the present paper we investigate the precise restrictions of the possible 
forms of the Lagrangian of a free point-mass, implied by symmetry assump- 
tions. We consider two Lagrangians equivalent if they result in the same Euler- 
Lagrange equations i.e. differ in a full time derivative. We use arguments both 
in relativistic and non-relativistic case which are correct from a mathematical 
point of view as well. We show that the problem in these two cases can in 
fact be handled in a very similar way. We use the spacetime model constructed 



without observers and reference frames, described in Matolcsi |11, 12], as a gen- 



eral and sophisticated tool for our observations. The formalism is based on a 
clever exploitation of the affine structure of non-relativistic and special relativis- 
tic spacetimes giving a method that is similar to the traditional tensorial one. A 
differential geometric treatment like would be unnecessarily technical and 
not very well fitted to this problem. Another advantage of our treatment is 
to avoid misunderstandings based on the the well known problematic formu- 
lation of general covariance in non-relativistic and special relativistic theories 
||. Spacetime symmetries are frequently explained by the equivalence of iner- 
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tial reference frames; the free point-mass is said to be invariant under transla- 
tions, rotations and velocity transformations of the reference frames. Why just 
the movements of the free point-mass would be invariant under such changes? 
Working without reference frames of course excludes similar problems: space- 
time transformations are really acting on the spacetime position and velocity of 
the particle. 

2. Point-mass in non-relativistic spacetime 
2.1 The non-relativistic spacetime 

Here we recall some basic structures of the frame independent formalism de- 
scribed in Matolcsi |12]. All the vector spaces in question are real. The non- 
relativistic spacetime is (M, I,t, D,b), where 

- M is spacetime, an oriented four-dimensional afhne space over the vector 

space M, i.e. the difference of two points in M is an element of M. By 
orientation we mean a fixed ordering of basis vectors of M. 

- I is time, an oriented one-dimensional affine space over I. The latter is a 

vector space, the measure line of time intervals. By its orientation, we 
have positive and negative time intervals. Between any two moments of 
J, the time interval is an element of I. 

- t : M — ► I is an affine surjection, making correspondence between a point 

of M and its absolute time in I. r : M — > J is the linear surjection under 
t, connecting each vector to its time interval. These two functions are the 
time evaluation functions. 

- D is an oriented one-dimensional vector space, the measure line of distances. 

- b : E x E — > D ® D is & positive definite symmetric bilinear mapping, the 

Euclidean structure, where E : = kcrx is the subspace of the spacelike 



We obtain the usual coordinate-description by vectorizing M with an origin 
in M and a basis in M. Coordinates are denoted by roman letters, and can 
have values 0, 1, 2, 3. These are written in superscript, while coordinates of the 
dual space M* are put in subscripts. We use Einstein-convention for summing 
indices in superscripts and subscripts denoted by the same letter. Vectors in E 
have first coordinate zero, we denote the other three indices by Greek letters. 

The history of a point-mass is described by a world line, a connected curve 
with timelike tangents (tangents not in E). Such a world line can be given as the 
range of a world line function, a continuously differentiable function r : I — * M 
defined on an interval, with r(r(t)) = t for any t G Domr. Using a construction 
similar to the one of tensor products, one can define the four-dimensional tensor 
quotient space ^f-. The derivative of a world line function is f : I — > ^f- with 
r(r(t)) = 1. Hence its values are elements of 



vectors. 
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the set of absolute velocity values. V(l) is an affine space over y , and r(t) G y||y 
if r is twice differcntiable at t G I. 

We need the set of mass values, which we define as follows. J measures time 
lengths, it contains the second, while D measures distances, containing meter. 
For simplicity we choose h : = 1, hence mass values (e.g. 1 ° c "*j ) are in D ^ D , 
the measure line of mass values [tHl . 

2.2 Galilean and Noether transformations 

Noether transformations are the automorphisms of the spacetime. These trans- 
formations keep the structure of spacetime, and can be described as follows. 
The proper Galilean group is 

Q : = {L G Lin(Af, M) : L preserves orientation, r • L = t, 

L\ E + -L\ E = id E } 

acting on M , and rotating spacelike vectors. The proper Noether group (inho- 
mogeneous Galilean group) is 

TV : = {L : M — > M : L is affine, and the underlying L is an element of (/}. 

The word proper refers to the fact that time or space inversion is not contained 
in these groups. 

Spacetime translations i.e. transformations of the form x i— > x + a with a 
given a £ M are Noether transformations, whose underlying linear operator is 
the identity of M. Q and Af are a six dimensional and a ten dimensional Lie 
group, respectively. They have the Lie algebras 

La(0) = {H G Lin(M, M) : t H = 0, H\ E = -H\ E } and 
La(7V) = {H : M -> M : H affine, and the underlying H is in La(g)}, 

respectively. By its definition, an H G La({?) is in fact an M — > E linear map. 
The Lie algebra of the subgroup of spacetime translations consists of elements 
H for which H = 0. Such an affine map is constant, i.e. there is an h G M 
such that Hx = h for all x G Mw. 

Every Noether transformation L in a neighborhood of the unit element idM 
has the form 

(1) L = e^: = I + ± isH ^;- sH 

for some s G M and H G La(AT). The underlying Galilean transformation is 



e 

71=0 



E 



2.3 Variational principle for point-masses 
in non-relativistic spacetime 

According to variational principles of mechanics, the point-mass moves along a 
world line from a spacetime point xq to another one Xi, for which the "varia- 
tion" of an "action function" is zero, possibly caused by having extremum or 
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stationary value of the action function at this world line. In case of hamiltonian 
variational principles the initial and final spacetime points are fixed (the dura- 
tion and the initial and final space points are not varied). We formulate this as 
follows. 

Given a Lagrangian depending continuously on spacetime points and speed 
values, and mapping to the one-dimensional vector space *: 

£ : M x V(l) -> j, 
the action on an r world line function is 

S(r): = j £(r(t), f{t))dt 

to 

with to : = t(xq) and t\ : = t(xi). 

In order to use analysis arguments, we define differentiability of the function 
r i ► S(r). Taking a norm | \m on M and a norm | m on y (any two norms 
on a finite dimensional vector space are equivalent). We introduce the vector 
space 

(2) V:={r : [io,ti] — > E \ r is continuously differentiable, r(to)=r(ti)=0} 

endowed with the norm 

IH|:= max (\r(t)\ M + \r(t)\*A . 
te[t ,ti] V 1 > 

Then 



V := {r : [to, t{\ — > M \ r is a world line function, r(t ) = x , r{t\) = x\} 

is an affine space over V. Hence differentiability of S : V — > M is well-defined. 

If S is differentiable, the world line function realized by the point-mass is 
selected by 

DS(r) = e Lin(V, K), 

i.e. the derivative of S having value zero. This corresponds to the "action having 
variation zero" . It is well known, that if the Lagrangian £ is twice continuously 
differentiable, then S is differentiable, and in this case, DS(r) = is equivalent 
to twice continuous differentiability of r satisfying the Euler-Lagrange equation 

Dx£(r(t), r(i)) - ^D 2 £(r(t), r(t)) - 0. 

Here Di stands for the partial derivative according to the first variable in M, 
and D2 for the derivative according to the second variable in V(l). From now 
on, we denote elements of M by x, and elements of V(l) by u, therefore we 
write 

d£(x,u) . . , ^.^s d£(x,u) 

— ^ — - e Lin(M, E/J), — G Lm(E/I,R/I) 
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for the partial derivatives, respectively. 

By the construction of this variational principle, it is clear that adding a 
"full time-derivative" to £ only means adding a constant to S, hence leaving 
DS* invariant together with the world line realized. We give precise meanings of 
these notions. 

A function f : M x V^(l) — > 1 is called a full time-derivative if there exists a 
4> : M — ► R continuously differentiable function, such that f(x, u) = D</>(x)it for 
all x G M and u G V(l). For any world line function r, f o (r, f) = (<f>o r)' holds 
in this case, hence the action corresponding to £ and to £ + f only differ by a 
constant. We say that £ and £' are equivalent, if £' — £ is a full time-derivative. 
This relation determines equivalence classes on the set of Lagrangians. 

2.4 Symmetries and the Lagrangian 

A motion of a physical system happens at the same way before and after a 
transformation, if the solution of the Euler-Lagrange equation is not affected by 
the transformation. We assume that not only these solutions, but the derivative 
DS characterizes as well the physics of the system. Hence we say that a trans- 
formation is a symmetry of the system, if it leaves the derivative DS* invariant, 
i.e. it only turns the Lagrangian into an equivalent one. This is a rather re- 
strictive standpoint, there are several examples for variational principles where 
only the equivalence of the solutions is required e.g. in non-equilibrium thermo- 
dynamics where the governing equations cannot be derived from a variational 
principle, therefore the usual variational prescriptions are shaken up p5| , but 
also in mechanics there are attempts to find physical consequences of that fact 
investigating the so called s-equivalent systems and the references therein). 
Now we exclude transformations leaving the solution invariant, but multiplying 
£ by a constant for example. 

Let F : M — > M be a continuously differentiable map, for which t ■ 
DF(x)u ^ for any u e V(l). We say that F is a symmetry of the Lagrangian 
£, if there exists a full time-derivative fp such that 

(3) £ (Fx, ) r ■ DF(x)u = £(*, u) + f F (x, u) 

\ T ■ \jt (X)U J 

for any x £ M, u G 1^(1). The definition considers that the transformation can 
change the (absolute) time and assures that a transformed world line function 
remain a world line function after the transformation (e.g. the second variable of 
£ is an element of V(l) and the integration should change to leave DS invariant 
after a reparameterization of the time scale) . The definition is more transparent 
in case of symmetries that do not rescale the absolute time (r • DF(x)u = 1). 
The above definition is valid uniformly in special relativistic and non-relativistic 
considerations, too. Non-relativistic space-time symmetries do not rescale the 
(absolute) time. Especially, a proper Noether transformation is a symmetry of 
£, if and only if 

&(Lx, Lu) — £(x, u) + ]l{x, u), 

since DL(x) — L, and r • L = t, tu = 1 by definition of the proper Galilean 
group and of V(l). 
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2.5 Lagrangian of a free point-mass 

If a point-mass is free, i.e. it is not influenced by any effect, then we "feel" that 
the translated, rotated etc. form of its trajectory is also a possible trajectory for 
it. To be more precise, we could say that by applying a spacetime-automorphism 
on a trajectory selected by the variational principle, we obtain another trajectory 
satisfying that principle. It is still not a precise statement, since it is not clear 
how to understand a "free" point-mass, "not influenced by any effect". We 
reverse the situation, and accept this concept as a definition. 

We define a point-mass characterized by £ to be a free point-mass, if each 
proper Noether transformation is a symmetry of £. Hence £ is the Lagrangian 
of a free point-mass if and only if for any L E Af there exists a (f>L '■ M — > M, 
for which 

£,(Lx, Lu) — £(2;, u) — T>4>l{x)u. 

We introduce the notation 4>(L, x) := </>i(x), and we assume that (f> : Af x 
M — > E is smooth enough. Although Af x M is not an affine space thus <fi is 
defined on a manifold, we only consider one-parameter subgroups of Af, hence 
we can use the usual differentiability notions. We take the elements in the 
neighborhood of I : = id^/ in the form (Q), and we differentiate by the parameter 
s. Since 



de 



ds 



= H and 



s=0 



ds 



s=0 



we obtain 
(4) 

9£(x, u) . . <9£(x, zt) 9 2 ^(L, x) 

+ — 57 — W " _ 



dx du dLdx 



L=I 



iv \ du(H, x) 
= ' ~dx ' 



by letting s — > 0. 

In order to compare our frame independent formulae with those of usual 
treatments, we write the coordinated forms of our expressions. We coordinate 
M by an appropriate basis, we vectorial M by the map x 1— > x — o with a fixed 
point o E M, and we consider the vector-coordinates of these vectors. Then 
H(x) = H(x — o) + h, where h := H(o) E M. If H has coordinates 
then H(x) has coordinates H l jX^ + h % . Let us remark here that sometimes one 
think on coordinates as a convenient tool for expressing tensorial calculations 
without the corresponding reference frames (see the concept of "abstract in- 
dexes" of Wald @|). However, a formulation of general covariance (observer 
independence) with an observer dependent notation easily can lead to misinter- 
pretation because a frame independent equation can lead to a formula containing 
observer dependent quantities in a particular reference frame (especially in non- 
relativistic spacetime, see the debate on the covariance of the kinetic theory e.g. 

[io| , [l3| p^[|). Therefore, although there is no convenient notation to book 
the different transposes of higher order tensors without indexes, it is important 
to formulate the results of the calculations in our frame independent notation, 
too. 

V(l) is an affine subspace in ^j-, the four coordinates of its elements are 
not independent, i.e. the zeroth coordinates are 1 in this space. Hence the 
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derivatives by elements of V(l) only contain indices 1, 2, 3. We will distinguish 
these possibilities in the notation. Greek letters are in {1, 2, 3} and Latin indexes 
in {0,1,2,3}. Double indexes denote summation. Therefore, the coordinated 
form of (PI) is 



(?£ , • ■ • . <9£ j doj 
— {H l ,x J + h l ) + ^—H a „u J = — 
dx l 3 du a dx 1 



without writing the arguments of our functions. 

First we consider the special case of spacetime-translations. Then H = 0, 
hence H(x) = h € M is the same constant for each x £ M. Therefore, we can 
write (Q) in the form 

d£,(x, u) du>(h, x) <9£ . dcv i 

dx dx dx l dx 1 

The left hand-side is linear in h, the right hand-side is linear in u. Hence the 
other sides also have these properties. Thus it follows that there are functions 
I, f : M —> M* for which 

ox ox 1 ox % 



and 



Therefore, 



_..«(.), |£ = |i 

ox ox 1 ox 1 



dli dfj 



(5) Dl(x) = (Dfnx) , ^ = ^. 

Assuming twice differentiability of £, we differentiate (JsJ) by x k . Changing 
the order of differentiation and applying (|^) again we obtain by Young's theorem 

d 2 f, d% d% d 2 f k 



dx k dx l dx k dxi dx : >dx k dx^dx 1 
As a result, we can get that 

d_(9f L _ dh 

dx 1 V dx k dxi 



D(Df-(D f y)(x) = 0, ]= . 



Hence introducing the antisymmetric linear map C : M — > M* with com- 
ponents Cjk, we obtain 

(6) C : = — D A / : = Df - (D/)* = const. , C jk : = - || = const. 



Therefore 

(7) =£>/(*) + C* , ^L = ?/l + C kj . 

ox k dx k 

We conclude 

£(&, u) = f(x) ■ u+ (x — o) ■ C ■ u + <pf(u) , £ = f(x) k u k + x k C k jU J + ipf(u), 



with an arbitrary function ipf : V(l) - 
also 

£(x, u) = l(x) ■ u + <pi{u) 

with an arbitrary function ipi : V(l) 
spacetime translation symmetry that 

(8) £(x, u) = l(x) ■ u + (p(u) , 

where I is a function satisfying 



1/ 1. Exploiting (Q) one may write 

£ = l(x) k u k + <pi(u), 
R/J. Hence we obtained from the 

£ = l(x) k u k + ip(u), 



(9) 



D A I = C = const. 



9% 



d 2 L 



dx k dx l dx k dxi 



This is all we can say; we could not prove the statement of (|, page 13]; 
from spacetime translation symmetry does not follow the independence of the 
Lagrangian of spacetime variables. 

Now we consider general Noether transformations, and substitute the form 
(ph of the Lagrangian into (Eh (recall that H is a map M — » E): 



dl(x) ■ u TT , , „ x „ dc9(w) Tr 



i.e. 



<9;F 



9Z fe 



dl/ 



(10) i^(tfV + ^ + WT"*U* + ^H« k U k = 



du(H,x) 
dx 



u, 



Differentiating this by x m leads to 



i)2 ' k -u k (W^ +h i ) + ^u k H* 
' dx 1 



01. 



a tjo k 



dx m dx l v J '~ " ' dx % '" dx m ^ dx m dx k 
The variable u is only present linearly in this equation, hence we can omit it: 



(ii) „ ; <//>' + w) + 

v ' dx m dx lX 3 ' dx 1 



dig 
dx m 



H° 



d 2 u 



dx m dx k 



We transpose the equation (consider it with the indices k and m interchanged) , 
and subtract it from the original form ([ll]) . Then the right hand-side is zero by 
Young's theorem, and the first term on the left hand-side disappears due to (0) : 



dl k 



dx 



-H l 



dl m 

dx % 



jji dig 
Hk+ dx^ H 



OX K 



As we know, H is a linear map M — > E, hence H 
sum over indices a = 1, 2, 3 instead of i = 0, 



o. 



dx' 



■H a 7 



dl dl 

^H a k + —H a k 
dx a dx m h 



0. 

0. Therefore, we can 
3 contained in the expressions 

dig 



dx k 



H' 



0. 



(12) 



Ck.gH 



kgil m 



Cnm.H k — 0. 
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We know that restricting H to E results in an antisymmetric map, and by 
the identification E* = D ^ D , we can interchange subscripts and superscripts 
of spacelike vectors: H a p = —Hp 01 . Thus separating the cases k = fi = 1,2,3 
and k = 0, we obtain for m = lo = 1, 2, 3, 

(13) Cp a H a LJ — C bja H a f3 = 0, 



(14) CoaH"^ + C LUa H a — 0. 

Let i : E — > Af be the embedding map; its transpose i* : M* — > E* is a 

* E 

linear surjection. Then Ce '■= i*Ci : E — > E* = — — — is an antisymmetric 

linear map. Equation (^3|) tells us that this commutes with any antisymmetric 
linear map H\e '■ E —> E: 

[C E , H\ e ] = 0. 

As a consequence, Ce commutes with all the elements of the rotation group of 
E, hence by Schur's lemma, it is a multiple of id.E, meanwhile it is antisymmet- 
ric. This is only possible if 

Ce = , C a (i = 0. 
Then, by (0), Co a H a u — for each H, which implies 

Coq = 0, 

finally 

C = 0. 

Then (|^) implies that the derivative of / is symmetric (i.e. the antisymmetric 
derivative is zero), hence I is a derivative of some function (f> : M — ► R: 

l = D(f> , l k = -^-r. 

OX K 

Thus the first term in expression (||) of the Lagrangian is a full time-derivative, 
which can be omitted. We conclude 

(15) £(x, u) = (p(u). 

Now it is clear that we needed not only translation invariance, but rotation 
invariance as well in order to exclude spacetime dependence of the Lagrangian. 
Writing (H) again, now we get 



r> f \ w dw(H,x) dip 8uj i 

(16) D<p(u).H.u=— u , _JJ,^ = _ U 



First we consider maps H, for which H\e = 0. For each of these maps, there 
exists at)£ j, such that H ■ u = v for all u 6 V{\). Then 

_ , . dw(H, x) dip duj { 
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The left hand-side does not depend on x, and contains v linearly. Hence the 
same holds for the right hand-side as well. Therefore there is a linear map 
A : M E* = -pllp such that 

Bip(u) = A-u , — — = A m u l . 

ou a 

Let Ae denote the restriction of A onto y; then fixing an element c of V(l) 
and putting a := Ac, we have 

Dip(u) = A E ■ (u - c) + a , = A aff {u - c p ) + a a . 

Differentiating by uP , we infer from Young's theorem that Ae is symmetric i.e. 
A a fj = Ap a and a simple calculation results in 

ip(u) = —{{Ae ■ {u — c)) • (it — c) + a ■ {u — c) + const.. 

The last terms here are full time-derivatives, which can be omitted. Using this 
form let us return to formula (Q): 

(At \\w du:{H,x) 
(Ae(u — c)) ■ H ■ u = 



dx 

The left hand side is zero at u = c, thus the right hand side, too, hence the 
previous formula can be written as 

{A E ■ (it - c)) • H ■ {u- c) = g{H, x) ■ (it — c) . 

The left hand side is bilinear in u — c, the right hand side is linear; this is 
possible only if the right hand side is zero. Note that here in fact the restriction 
of H onto E appears; the well-known properties of the antisymmetric maps 
H\ E : E -> E imply that {A E ■ {u - c)) • H ■ {u - c) =0 can hold for all H 

only if Ae{u — c) is parallel to {u — c) i.e. there is an m € — — — (mass value) 
such that ^ 

tp{u) = —m\u — c| 2 . 

2.6 Reformulating the variational principle 

We show a method which we could have used but which would not have had 
any advantage. We only introduce this method here because we need to apply 
it later in the relativistic case, and we can show similarities to that case. 

A world line of a point-mass has been given as the range of a world line 
function so far. This is in fact not necessary, since we can parameterize such 
a curve in many different ways. Let 1 be any fixed positive element of J, and 
for a world line function r, let p : [0, 1] — > Ranr be a parameterization for 
the corresponding world line. By the inverse function theorem, a : = r _1 op : 
[0, 1] — > [to, t{\ is a continuously differentiable bijection, and 

p = (r o <t)<7, 
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for which applying r we obtain 

< t o p = &. 

The function a is given by p (as a primitive function of Top), hence r is also 
determined by p. We have 

r o a = p , r o a = . 

Top 

and 

p pT op 



(t o p) 2 (t o p) 3 

holds as well in case of twice differentiability. We see that r satisfies a second- 
order differential equation if and only if p does so. 

The action according to this new parameterization can be computed using 
integral transformation: 



(17) / £(r(t), r(t)) df= / fl(r(«7(a)), f(a(a)))d(a) da = 

to 

1 

= Jz(p(a), -P^j T -p(a)da. 
o 

According to the new parameterization, p is allowed to be an arbitrary future- 
like vector, not only an element of V(l). Let C denote the set of future- 
like vectors. This is an open set containing V(l). By the integral transformation 
above, the Lagrangian can be considered as a function 

TjJ 

£ : M x -»■ j , (x, w) h-> £{x, w) 
having the property 

£(x, w) — 2 (x, ) t ■ w. 

\ t ■ w / 

Hence the Lagrangian £ : M X 7(1) — > j used so far is now extended to a 
function M x — > j according to jl7| ) ; the action itself only depends on the 
world lines, not on their parameterization. 
Now we consider the vector space 

V := {p : [0, 1] — > M | p is continuously differentiable, p(0) = p(l) = 0}, 

endowed with the norm 

||p||:= max (\p(s)\ M + |i>(s)U) 
«e[o, l] 1 

and the affine space 
V := {p : [0, 1] — > M | p is continuously differentiable, p(0) — xq, p(l) — xi} 
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over V . For the action 

1 

S : V^R , J £(p(s), p(s)) ds, 

o 

we can repeat everything we said before, and its extremal points give the ex- 
tremal points of the original action function. 

The original Lagrangian is a restriction of this new one to the set M x V(i). 
Hence it is clear that the extended Lagrangians £ and £' differ by a full time- 
derivative if and only if their restrictions do so: £'(x, u) — £(x, u) = D(p(x)u 
for all (x,u) S M X V(l) is equivalent to £!{x, w) — £(x, w) = Dip(x)w for all 
(x, w) S M x N^. By definition (J3J) of the symmetries, it is clear that we call 
F a symmetry of the extended Lagrangian, if 

£(Fx, DF(x)w) = £(x, to) + f F (x, w) ((x, w) e M X N^). 



3. Point-mass in relativistic spacetime 
3.1 The relativistic spacetime 

This model is also introduced and described in details in Matolcsi |jl2). The 
relativistic spacetime is (M, /, g), where 

- M is spacetime, a four-dimensional oriented real afnne space over the vector 
space M. 

- I is the measure line of time intervals, an oriented one-dimensional real 

vector space. 

- g is an M x M — > I ® I arrow-oriented Lorentz form. 

With the use of the speed of light, physical distances can be identified with time 
intervals, hence there is no need for a new measure line besides I (we use the 
"unit system" % = c = 1). 

The motion of a particle is described by a world line, a connected curve with 
timclike tangents. A world line is naturally given as the range of a world line 
function. The latter is a continuously differentiable function r : I — > M defined 
on an interval, and 



r(/i-r ( i): {ue™ 



-l,u is future- like 



An essential difference is, compared with the non-relativistic case, that V(X) is 
not an affine space here. 

The time passed along a world line can be measured as follows. Let xq and 
x\ be two points of a world line C . We consider an arbitrary parameterization 
p of this world line, having for simplicity the domain [0, 1] and values p(0) = 
xq, p(l) = x\. The time passed between these two points on the world line C 
is 



t c (x , xi) = / y/\p{a)p(a)\ da. 
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Especially, if the parameterization is the world line function r, then tc{xo, %i) = 
r^ 1 (x 1 ) - r^ 1 (x ). 

3.2 Lorentz and Poincare transformations 

Transformations preserving the structure of the spacetime, i.e. the automor- 
phisms of spacetime are called the proper Poincare transformations. They can 
be described as follows. 

£ : = {L g Lin(Af, M) \ L is orientation and arrow-orientation preserving, 



is the proper Lorentz group, and 

V : = {!/ : M — > M \ L is affine, and the underlying 

is the proper Poincare group. The word "proper" relies to the absence of time 
or space inversion in these groups. Spacetime translations, i.e. transformations 
of the form x ^ x + a (with a 6 M) are Poincare transformations whose 
underlying linear operator is the identity of M. 

C and V are a six dimensional and a ten dimensional Lie group. They have 
the Lie algebras 



La(£) = {H G Lm(M, M)\H + = -H) and 
La(P) = {H : M -> M \ H is affine, and the underlying H G La(£)}, 



respectively. 

The Lie algebra of the subgroup of spacetime translations consists of ele- 
ments H for which H = 0. Such an affine map is a constant, i.e. there is an 
h e M, such that Hx = h for all x G M. 

Every Poincare transformation L in the neighborhood of the unit element 
idM has the form 



for some s G R and H G La(£). The underlying Lorentz transformation is 



3.3 The variational principle in relativistic spacetime 

In a naive way, we would say that given a continuous function £ : MxF(l)^ 
j, a point-mass propagates from a spacetime point xq to another one x\ along 
a world line, on which the action 



L+-L = id M } 



(18) 




71=1 





-1 



(19) 
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is extremal. But this leads to a very serious problem. Along different world lines 
connecting xq and x\ , different amount of time passes, hence the domains of such 
world line functions are not common. The set of such world line functions can 
not be made an affine space directly, we cannot use the methods of differential 
calculus. Would we step over these problems somehow, there would be another 
problem with the Euler-Lagrange equation: V(l) is not an affine space, hence 
differentiating by the speed variable would only be possible using the theory of 
manifolds. 



We help this problem by the method shown in section |2.6| . World lines will 
be parameterized by an arbitrary parameter having domain [0, 1] C /. For 
a world line function r, let p : [0, 1] — > Ranr be a parameterization of the 
corresponding world line. Then a : = r^ 1 op : [0, 1] — > [to, t\] is a continuously 
diffcrentiable bijection (to — r^ 1 (xo) and t\ = r~ 1 (xi)), and 



p = (r o a)a , < \p\ : = y/—p ■ p = &. 

Therefore, a is determined by p (as the primitive function of \p\), r can also be 
restored from p, and 

P 

r o a = p , r o a = — ; 

\P\ 

in case of twice differentiability we have 

P , Pip-P) 



r o a 



N 2 \i>\ 4 



We see that r obeys a second-order differential equation if and only if p does so. 
We allow any parameterization as described above, hence its derivative can be 
any future-like vector, not only an element ofV(l). Let C ^ be the set of 
the future-like vectors. This is an open set, containing V(l). 
We introduce the vector space 

V := {p : [0, 1] — > M | p is continuously differentiable, p(0) = p(l) = 0} 

endowed with the norm 

||p||:= max :(\p(a)\ M + \p(a)\sf) 

and the affine space 

V := {p : [0, 1] — » M \ p is continuously differentiable, p(0) = xq, p(l) — x±} 
over V. 

We are given a twice differentiable Lagrangian 

£ : M x -> j , (x,w)>-^ £(x, w), 

satisfying 

(20) £(x,w)=&L,-^A\w\. 
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We assume that the particle moves in such a way that the parameterization p 
of its world line between points xq and x\ in spacetime makes the derivative 
DS(p) of the action 



S : V -> K , p« £(p(s), p(s))ds 



zero. Equation (EOj) is trivial in the physically relevant cases w <E V(l), since 
\w\ = 1 for such speed values. Hence ( po|) only gives properties of the exten- 
sion of the Lagrangian from V(l) to , it does not mean any extra physical 
condition on the Lagrangian. By (EQ) and by replacing the integral variable, it 
is easy to show that the action is independent of the parameterization p of the 
world line. 



We can repeat all our arguments we had in section 2.3; the parameterization 
p of the world line realized satisfies the appropriate Euler-Lagrange equation. 
The time scale invariance property of special relativistic point-mass dynamics 
is a well known problem of mechanics. The Lagrangians satisfying (^0|) arc 
sometimes called "homogeneous" and this is why one cannot formulate Hamil- 
ton equations and canonical transformations in special relativistic mechanics 
without any further ado i. e. the formal Hamiltonian defined analogously to 
the non-relativistic case is identically zero (see e.g. From our treatment 

one can see clearly that the scale invariance property cannot be avoided, con- 
trary to the opinion of Goldstein R p329]: "... it is not a sacrosanct physical 
law that the action integral in Hamilton's principle must have the same value 
wether expressed in terms of t of in terms of ... All that is required is that L 
be a world scalar that leads to the correct equations of motion." If we gave a 
clear meaning to the expression " leads" and without knowing it we looked for a 
"correct" equation of motion, then we should exclude every Lagrangian where 
the action would depend on the parameterization of the world lines. 

The d efini tion of symmetries (^) and the scale invariance property treated 



in section 2J3 was a somewhat artificial construction, because in non-relativistic 
spacetime the time is absolute. However, the striking similarity of the non- 
relativistic and relativistic concepts deserves the attention. 



3.4 Symmetries and the Lagrangian 

We call a full time-derivative a function f ; M X — > j, if there exists a 
function <j> : M — ► K such that f(x, w) — D(f>(x)w. Two Lagrangians determine 
the same variational principle (are equivalent), if and only if their difference is 
a full time-derivative, i.e. the derivative DS connected to them agree. 

We say that a continuously differentiable map F : M — > M, for which 
~DF(x)w £ for each w £ N~*, is a symmetry of the (homogeneous) La- 
grangian, if there is a full time-derivative fp, such that 

&(Fx, BF(x)w) = £(x, to) + f F (x, w) ((x, w) G M X AT*). 

Especially, a proper Poincare transformation L is a symmetry of £, if 

Z(Lx, Lw) = Z[x, w) + ft, (a:, w). 
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3.5 Lagrangian of the free point-mass 

Similarly to the non-relativistic case, we call the point-mass free, if each Poincare 
transformation is a symmetry of the corresponding Lagrangian £. Hence £ is 
the Lagrangian of a free point-mass, if and only if for all L G V there is a 
function <pL : M — ► R such that 

£(Lx, Lw) — £(x, w) = D4>l{x)w. 

We introduce the notation <fi(L, x) : — (f>i,(x), and we assume that this function 
V x M — > R is smooth enough. Although <f> is defined on the manifold V x A/, 
we only consider one-parameter subgroups of V, hence we can use our usual 
differentiability notions. 

We consider transformations in a neighborhood of I : = id^f in the form 
18b. We obtain 



■{H, w) = : w 

L=I ° X 



(21) 



d£(x, w) dZ{x, w) d 2 (f>(L, x) 

■H(x)-\ H-w - 



dx dw dLdx 



the same way as in the non-relativistic case. We write the coordinated form as 
well: 



mi i . i\ 92 TT , . du! 



As in the non-relativistic case, we find 

(22) £(x, w) = l(x) ■ w + <p(w) , £ = l{x) k w k + <p(w), 

with an arbitrary function ip : V(l) — > R/J and with I : M — » M* for which 

C : = D A I : = (DZ)* - DZ = const. , C kl : = - -4 = const. 

By considering general Poincare transformations and substituting the form 
2|) to (^lb, we conclude 

dl(x) ■ w __, , ,, . „ dw(w) duj(H,x) 
ox dw ax 

i.e. 

§^(^^ + h*) + V?W + gtfW = 
Differentiating the latter by x m leads to 

—w K (H t 1 x 3 + h l ) + —^w K H % m + H l k w K = - —rw K . 

dx m dx l 3 dx 1 dx m dx m dx k 

By the method shown in the non-relativistic case, it follows that 

CkiH l m — C m iH l k = 0. 
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H is antisymmetric and, by the identification M* = subscripts and 

superscripts can be interchanged, hence H % k = —H k i = —Hk- Using also 
antisymmetry of C, 

f~i TTl TT if! n 

i.e. 

[C, H] = 0. 

for all antisymmetric H . As a consequence, C commutes with all proper Lorentz 
transformations, hence C is a multiple of idjvi by Schur's lemma. On the other 
hand, it is also antisymmetric, thus 

C = Q , C ik = 0. 

Based on this, as in the non-relativistic case, we conclude that the Lagrangian 
has the form 

£(x, w) = <p(w) 



by omitting a full time-derivative. According to (20) we have 



a: 



w 



Choosing an arbitrary number A > and using the previous equality, we obtain 

tp(Xw) = f\^jn \ Xw \ = A<y3 (]^|) l w ' = At ^M' 

differentiating with respect to A and substituting A = 1, we get 

dip 

(23) Dw(w) ■ w = <p(w) , — — j-w k — <p. 

Since the Lagrangian does not depend on spacetime points, considering also 
the equivalence of the Lagrangians by a full time derivative, its symmetry is 
formulated as follows: for all Lorentz transformations L there exists an a{L) 6 
M* such that 

(24) ip(Lw) = <p(w) + a(L) ■ w + const.. 
We differentiate this equation with respect to w: 

Bip(Lw)L = D(p(w) + a(L) , ^-{Lw)L\ = J^M + a{L) k . 

Considering a Lorentz transformation for which Lw = w holds, we have 
a(L) = by (|2J), hence for all such transformations Dip(w)L = T)ip(w), thus 
L*Dip(w) = Dtp(w). By L* = L , this means 

D(p(w) = LDtp(w) if Lw = w. 

Since {L \ Lw = w} is just the rotation group of the three-dimensional subspace 
orthogonal to w (i.e. it is a "little group" of Wigner), the equation above can 
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only hold in case Dip(w) is parallel to w. This means that there exists a function 
(3 : M -> R such that 



Dip(w) = (3(w)w , — j: = /3 w k , 



dp 

hence multiplying ( p3| ) with to, we obtain 

k a k dip k 

tfWi = — rffl Wi = fjw k W Wi= — rWfc W , 

thus the differential equation 

<p(w)w dp ipwi 



Dp(w) 



w ■ w dw l w k Wk 



holds for p. This partial differential equation can be handled with methods of 
ordinary differential equations. For the zeroth variable for example, by fixing 

3 

all the three other variables, with notations x : = w° — —wq, a 2 : — ^ w t Wi, 
we have the ordinary differential equation 

dtp —tpx tpx 



dx —x 2 + a 2 x 2 — a 2 ' 

or for the first variable, with fixed other variables and with notations y : = w 1 
u>i, b 2 := -(w°too + w 2 W2 + w 3 W3) = —w ■ w + w 1 u>i > 0, 

dtp ipy 



dy y 2 — b 2 



Then it is easy to see that the solution of (23) is 

p(w) = m \w\, 
with a constant m G j (mass value). 



3.6 Discussion 

We obtained the Lagrangian 

£(x, u) = —m\u — c\ 2 

of a non-relativistic free point-mass (with a mass value m and an absolute veloc- 
ity c) by considering Noether transformations (members of the inhomogeneous 
Galilean group) of form e sH but it is a simple fact that every Noether trans- 
formation is a symmetry of the Euler-Lagrange equations based on the above 
Lagrangian. 

We remark that different c-s correspond to different but equivalent La- 
grangians, while different m-s result in inequivalent Lagrangians. The La- 
grangian given by c is the kinetic energy of the point-mass, relative to the 
inertial observer having absolute velocity c. 

The Lagrangian does not depend on spacetime points, hence is invariant for 
spacetime translations. On the other hand, it is not invariant for all Noether 
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transformations: if L is a Noether transformation whose underlying linear map 
is the special Galilean transformation L v with speed v, then 

£(Lx, Lu) = —m\(u + v) — c| 2 = —m\u — (c — v)\ 2 , 

i.e. the Lagrangian turns into the (equivalent) Lagrangian given by c—v € V'(l). 

As a consequence, we see that if we required that the Lagrangian itself be in- 
variant for all Noether transformations, then we should get a constant function. 
This fact shows well, why our symmetry definition is preferable among all oth- 
ers. Evidently, we cannot require the invariance of the action, because it can be 
different for the very same Euler-Lagrange equations, a full time derivative gives 
an additional constant (in field theories it is excluded by appropriate bound- 
ary conditions). We cannot require the invariance of the solutions, because the 
gained freedom is too large pl| . That we really require is the invariance of the 
Euler-Lagrange equations that appears as an invariance of an equivalence class 
of the Lagrangians. 

We emphasize that spacetime homogeneity (invariance for spacetime trans- 
lations) alone does not imply that the Lagrangian does not contain explicit 
spacetime dependence, contrary to usual statements ||]: spacelike rotations, 
too, are necessary to deduce this result. 

We give a simple counterexample. Choosing an "origin" o £ M and an 
antisymmetric linear map B : M — > M* , we define the Lagrangian 

Z(x, u) := (x — 6) ■ B ■ u + <p(u) 

according to (||). For B ^ 0, the first term is not a full time-derivative, hence 
£ and all Lagrangians in its equivalence class depend explicitly on spacetime 
points. On the other hand, a spacetime translation with any a G M is a 
symmetry of this Lagrangian: 

Z{x + a, u) ~ £(x, u) = a ■ B ■ u 

and the right hand-side is a full time-derivative (of the function x t—> a ■ B ■ 
(x — o)). The statement that spacetime homogeneity implies independence of 
£, of spacetime variables is hence not true. This Lagrangian is symmetric to 
spacetime translations, and contains an essential spacetime dependence. The 
Euler-Lagrange equation has the form 

2B-r- Dip(f) ■ f = 0, 

which is clearly invariant under spacetime translations but momentum is not 
conserved. 

We obtained the Lagrangian 

£(x, w) = m \w\. 

of a relativistic free point-mass (with a mass value m) by considering Poincarc 
transformations of form e sH but it is a simple fact that every Poincare trans- 
formation is a symmetry of the above Lagrangian. 

Lagrangians with different to-s are inequivalent. The Lagrangian itself and 
not only its equivalence class is invariant to all Poincare transformations. There 
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are no other possible choices with our assumptions. Goldstein [Q admits L — 
mf(\w\) with a two times differentiable monotonous / : K — » j as a Lagrangian 
of a free point-mass. However, he requires the symmetry of the solutions of 
Euler-Lagrange equations in a restricted sense and admitting this possibility 
one can get far more general Lagrangians (see |2l[] on the possibilities). 

As in the non-relativistic case, spacetime homogeneity (invariance for space- 
time translations) alone does not imply that the Lagrangian does not contain 
explicit spacetime dependence. The above counterexample for spacetime depen- 
dence and non-conservation of momentum can be repeated word by word. 

The relativistic action depends only on the parameterized world line, not on 
the parameterization itself. We conclude that the action is in fact m times the 
proper time passed along the world line which is usually stated as "obvious" . 
The Lagrangian is constant on the absolute velocity values, i.e. on the set M x 
V(l): £(x, u) = m if u 6 V^(l). According to ([l9|), the particle moves between 
two timelike-separated points by minimizing its proper time passed. This also 
shows problems with definition (|l9|): a constant Lagrangian leads to a trivial 
Euler-Lagrange equation, which holds for any motion. We do not have hence 
Euler-Lagrange equation for world line functions. In || for example, variational 
principle is written for motions parameterized by the time of an observer. 

For the parameterization p of a world line of a free point-mass, we have the 
Euler-Lagrange equation 

P \ * • P 

Try = , i.e. = constant, 

\P\J \P\ 

hence the particle has constant velocity, the corresponding world line is a straight 
line. 

Let us mention here, that the parameterization invariance of the relativistic 
Lagrangian leads to the famous difficulties of relativistic dynamics and con- 
nected to degeneracy and non-covariant property of the traditional (pseudo) 
energy-momentum in general relativity as well. In general relativity one can 
find examples to the extension of the Lagrangian (locally from T^(l) to iV - * 
with our notation) p^] , but a different possibility is to work with Dirac's for- 
malism based on constrained variations j|, 

Our treatment shows clearly that variational principles and symmetries in 
both the non-relativistic and the relativistic case can be handled in a very similar 
way; most of our arguments are identical almost word by word. We have got 
the well-known results in a rigorous way. The spacetime models and the exact 
formulation of the problem helped us to see the proper reasons of these well- 
known results. 
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